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Linear vector waves, both quantum and classical, experience polarization-driven bending of ray 
trajectories and polarization dynamics that can be interpreted as the precession of the “wave spin”. 

Both phenomena are governed by an effective gauge Hamiltonian, which vanishes in leading-order 
geometrical optics. This gauge Hamiltonian can be recognized as a generalization of the Stern- 
Gerlach Hamiltonian that is commonly known for spin-1/2 quantum particles. The corresponding 
reduced Lagrangians for continuous nondissipative waves and their geometrical-optics rays are de¬ 
rived from the fundamental wave Lagrangian. The resulting Euler-Lagrange equations can describe 
simultaneous interactions of N resonant modes, where N is arbitrary, and lead to equations for the 
wave spin, which happens to be a ( N 2 — l)-dimensional spin vector. As a special case, classical equa¬ 
tions for a Dirac particle (N = 2) are deduced formally, without introducing additional postulates 
or interpretations, from the Dirac quantum Lagrangian with the Pauli term. The model reproduces 
the Bargmann-Michel-Telegdi equations with added Stern-Gerlach force. 


I. INTRODUCTION 

The geometrical-optics (GO) approximation is widely 
used to model waves in very diverse contexts, which 
range from quantum particle dynamics to electromag¬ 
netic (EM), acoustic, and even gravitational phenomena 
M. As it is well known, GO is a theory asymptotic with 
respect to a small parameter, e, that is a ratio of the wave 
relevant characteristic period (temporal or spatial) to the 
inhomogeneity scale of the underlying medium. Absent 
resonances, just the lowest-order (“eikonal” Q , or “e°”) 
approximation is applied most commonly, within which 
a wave exactly matches one of the local eigenmodes at 
any given location; then vector waves can be treated 
exactly like scalar waves [3|. However, this e° model 
misses important physics when e is not vanishingly small 
or, alternatively, at large enough times. By including 
the first-order corrections in GO, one finds polarization- 
driven bending of ray trajectories , and one can 

also describe both adiabatic and diabatic mode conver¬ 
sion at resonances eeei. These e 1 -effects were studied 
extensively in applications to quantum particles and EM 
waves EH- However, the existing theories remain ad hoc 
(e.g., are restricted to transverse waves in media with 
no spatial dispersion) and also cannot describe resonant 
coupling of more than two modes, if at all. Hence, they 
cannot treat many effects that are both relevant and im¬ 
portant, particularly those in warm plasmas El- The 
existing eGtheories also have not quite unified relativis¬ 
tic quantum and EM effects beyond qualitative analogies 
and simple cases EH, EH • Those are long-standing prob¬ 
lems [13[; however, the recent development of a universal 
axiomatic description of nondissipative linear waves E2 
makes them potentially solvable. 

Here, we develop a general first-principle e 1 -theory of 
resonant nondissipative vector waves. We start with the 
fundamental representation of the wave Lagrangian den¬ 
sity (LD), Eq. ©, decompose the field into local eigen- 
modes, and then simplify the resulting LD by neglecting 
terms of the second and higher orders in e. The reduced 


LD that is obtained describes some N eigenmodes of the 
underlying medium, which are coupled through an effec¬ 
tive “gauge Hamiltonian” of order e. We consider waves 
with frequencies in a narrow enough range, O(e), around 
the central frequency; in this case, the gauge Hamilto¬ 
nian can be expressed as a function of coordinates and 
the central wave vector. As such, it serves as a gen¬ 
eralization of the Stern-Gerlach (SG) Hamiltonian that 
is commonly known for spin-1/2 quantum particles. We 
then show how to parameterize the LD in order to obtain 
dynamical equations for continuous GO waves and also 
for their rays. These equations describe both adiabatic 
and diabatic mode conversion of all N resonant modes 
simultaneously. We also show that the dynamics of the 
TV-dimensional complex polarization vector can be rep¬ 
resented as the precession of a real ( N 2 — l)-dimensional 
fixed-length vector, which is interpreted as the wave spin. 
(For the special case when N = 2, a similar vector has 
been known as pseudospin, or Stokes vector EE EH-) 

To illustrate our theory, we also apply it to derive 
a classical model of a Dirac particle, which by itself 
has been a long-standing and controversial problem; see 
Refs. EH2 for reviews. We show that our formula¬ 
tion leads to the well known Bargmann-Michel-Telegdi 
(BMT) equations EH, EH but with added SG terms, 
which are typically missed in straightforward semiclas- 
sical theories EE EE • Our calculation is the first one 
that formally deduces these corrected BMT equations 
from quantum theory without postulating any quantum- 
classical correspondence except the GO limit. (A more 
detailed comparison with earlier theories is given in 
Sec. lVIILDl l This part of the presented research also can 
be considered as a follow-up to our Ref. I27j , where the 
correspondence between quantum and classical LDs was 
studied in application to spinless and Pauli particles. 

Overall, the advantages of our new theory are as fol¬ 
lows: (i) The theory is derived in a variational form, 
so the resulting equations are manifestly conservative, 
(ii) The theory assumes no specific wave equation; hence, 
the equivalence between spin effects in quantum and clas¬ 
sical waves is automatically made quantitative, (iii) The 
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theory is naturally suited to serve as a stepping stone for 
studying ponderomotive effects on vector waves, in con¬ 
tinuation of the recent Ref. jUJ, where studies of such 
forces on scalar waves were initiated. In particular, it ren¬ 
ders possible a first-principle calculation of spin correc¬ 
tions to ponderomotive forces on electrons, which prob- 
lem has been enjoying much attention recently [29H34| . 
However, reporting this and other applications of the for¬ 
malism presented here is left to future publications. 

The paper is organized as follows. In Sec. [II] we de¬ 
fine the basic notation. In Sec. nm we present a general 
formalism describing vector waves. In Sec. IIVI we de¬ 
rive a general expression for the reduced LD of a wave, 
but its specific parameterizations are left to the follow-up 
sections. In Sec. El we discuss the leading-order approx¬ 
imation, in which the LD is parameterized as a scalar 
wave and leads to the standard equations of leading-order 
GO. In Sec. ED we discuss a more precise model yet 
treat polarization effects as a perturbation. In Sec. 1VIII 
we present self-consistent fluid and point-particle mod¬ 
els. In Sec. IVIII1 we apply our theory to a Dirac particle 
and compare our model with related theories. In Sec. m 
we place our results in the wider context of general wave 
studies. In Sec. m we summarize our main results. 


II. NOTATION 


The following notation is used throughout the paper. 
The symbol “=” denotes definitions, “c. c.” and “h. c.” 
denote “complex conjugate” and “Hermitian conjugate”, 
respectively; also, Ijv denotes a unit N x N matrix, and 
hat (") is reserved for differential operators. We use nat¬ 
ural units, so the speed of light equals one (c = 1), and 
so is the Planck constant (H = 1). The Minkowski met¬ 
ric is adopted with signature (—,+,+,+), so, in partic¬ 
ular, d 4 x = dtd 3 :r. Generalizations to curved metrics 
are straightforward to apply E3- Greek indexes span 
from 0 to 3 and refer to spacetime coordinates, with 
x° corresponding to the time variable, f; in particular, 
d^ = d/dx^. Latin indexes span from 1 to 3 and denote 
the spatial variables, x l (except where specified other¬ 
wise); in particular, di = d/dx z . Summation over re¬ 
peated spatial indexes is assumed. (However, the sum¬ 
mation rule does not apply to mode indexes q and r.) 
Also, for a given matrix M. we define its Hermitian and 
anti-Hermitian parts as 

M H = \(M + M +), M A =^(M-Mt), 
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so M = Mh + iMa■ Note that iMa is anti-Hermitian 
while Ma itself is Hermitian. Also, in Euler-Lagrange 
equations (ELEs), the notation “6a denotes that the 
corresponding equation was obtained by extremizing the 
action integral with respect to a. Finally, the abbrevia¬ 
tions used in the text are summarized as follows: 


ACT 

action conservation theorem, 

BMT 

Bargmann-Michel-Telegdi, 

ELE 

Euler-Lagrange equation, 

EM 

electromagnetic, 

GO 

geometrical optics, 

LD 

Lagrangian density, 

SG 

Stern-Gerlach. 


III. BASIC EQUATIONS 
A. Lagrangian density and Hamiltonian 


The dynamics of any nondissipative wave is governed 
by the least action principle, <5A = 0, where A is the 
action integral, 

A = J £d 4 s. (1) 


For a linear wave, the function £, termed LD, always can 
be written in the following representation [17| . 


£ = 1 [ip\d t ip) - (d t ip ] )ip] - ip* 1 Hip, (2) 


which we call (the density of) the fundamental wave La¬ 
grangian. Here ip is a complex vector field (“state func¬ 
tion”) of some dimension N, ip' is its adjoint, and H 
is some Hermitian operator called Hamiltonian. In the 
Minkowski space assumed here, it also can be expressed 
as H = H(t , x, k), where H is some N x N matrix func¬ 


tion, x = x is the position operator, k = — i V is the wave 
vector (momentum) operator, and the standard coordi¬ 
nate representation is assumed m 

We will consider H linear in k. In some systems, such 
as a Dirac particle (Sec. I VIII j) . Hamiltonians have this 


form originally; in others, H can be made linear in k by 
extending the state function (Appendix[Al) or by expand¬ 
ing the true Hamiltonian around some large central wave 
vector E3- In any case, the most general representation 
of such (Hermitian) H can be adopted in the form 


H = A° h + (A- k) H (3) 


where A^ = A ll (t, x) are some N x N matrices, and A is 
a column comprised of A° . [In principle, terms of higher 
orders in k could be retained too, as a perturbation, but 
only if they remain less or comparable to the small ener¬ 
gies to be discussed below, such as U given by Eq. (EH).] 
Notice now that 

(A-k) H = l(A-k + k-A^) 

= 2 \(Ah + iAa) • k + k • (Ah — iAa)\ 

= \ {Ah • k + k ■ A h ) + ^ [A j a , -idj ] 

= ^ (Ah • k + k • Ah) ~ C. 


( 4 ) 
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Here [•, ■] is a commutator, and 

C = \[d j ,A j A ] = \v-A A (5) 

is a Hermitian matrix, as iAa is anti-Hermitian. Then, 

£ = ^ \ip^(d t ip) - (d t ip^)ip] - ip* Hip. ( 6 ) 

Here we introduced a Hermitian operator 

F=i[a-HV) + HV)-a] + A, (7) 

which serves as a new Hamiltonian, and also the following 
Hermitian matrices: 

a j =A H , A = A° h -C. (8) 


In addition to Eq. ©, the LD also has other equivalent 
representations. Below, we present some of them that we 
will need for our purposes. 

B. Super-Hamiltonian H 

First, it is convenient to rewrite Eq. © as follows: 

£ =—ip*(—id t + H)ip — d t (iip*ip/2). (9) 

Since the latter term yields zero contribution to the ac¬ 
tion integral ©, the LD can be equivalently expressed as 

£=-ip*Hip, n = -idt + H. ( 10 ) 

The operator H can be understood as the super- 
Hamiltonian and can be reexpressed as follows, 

^ = ^KK) + HW + a 

= a'*(-»a M ) + A-|(V-a), (11) 

where a 0 = 1^. The corresponding ELEs are 

5ip* : Hip = 0 ( 12 ) 

and the adjoint equation for ip*. 

C. Alternative form of £ 

Equivalently, Eq. © can be cast as follows: 

£ = ^ [ip*(d t ip) - (d t ip*)ip] + ip* a ■ Vip 
+ 2 ' (“VO — ip*Xip 

= \ bP^dtip) - {dtip f )4>] + ^ [ip* a • (Vip) 

— (Vip*) • aip] — ip*Xip + — V • (ip*aip). (13) 


Like before, one can omit the divergence term; then, 

£ = ^ [^(a^d^ip - c. c.] - ip*Xip 

= —Re [ip*(—ia^d^ip + A)i/>]. (14) 

This result also flows from £ = —Re (ip* Hip), since 
ip*i(v ■ a)ip is imaginary due to V • a being Hermitian. 

D. Problem outline 

Below, we will consider waves such that 

ip = e^, (15) 

where 6 is some rapid real phase (yet to be specified), 
and £ is a vector evolving slowly compared to 9. This 
implies the following: (i) a and A evolve slowly (if at all), 
and (ii) among all the N dispersion branches, there are 
only some N < N branches that are excited, which have 
local frequencies in some narrow enough range, A ui/ui = 
O(e) <C 1, around the central frequency 

ui = —d t 9. (16) 

We will call these branches “active” and, for clarity, 
assign to them indexes q = 1,..., IV. [The remain¬ 
ing, “passive” branches will be assigned indexes q = 
(N + 1),..., N, correspondingly.] The goal of the cal¬ 
culation presented below is to derive an approximate LD 
for the active modes that would be accurate up to 0(e 1 ). 

IV. REDUCED MODEL 

A. Exact eigenmode representation 

In general, there exist N eigenfrequencies uj q (k) corre¬ 
sponding to a given local wave vector 

k=V6>. (17) 

Those are found from the local dispersion relation, 

det[H(k) -Ijyu> q (k)} = 0, (18) 

i7(k) = a • k + A. (19) 

(The dependence on t and x is present too but will not be 
emphasized except when it is necessary.) Corresponding 
to w 9 (k) are some eigenvectors of H( k), which we denote 
as h q . Since H( k) is Hermitian, h q can be chosen such 
that they form an orthonormal basis, in which any £ can 
be decomposed as 

N 

9=1 


( 20 ) 
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The coefficients a 9 are scalar complex functions of x v . 
Typically, aj = (a 9 )*, but we prefer to distinguish aj 
and (a 9 )* to allow also for (somewhat exotic) negative- 
energy waves, which have a) q = — (a 9 )* and should not be 
confused with negative-frequency waves |35j. This also 
renders the notation more compact. 

Now let us express the connection between £ and a q 
in a matrix form, £ = Ha. Here a is a column vector (of 
length N) comprised of a 9 , and H is a NxN fundamental 
matrix, which has vectors h q as its columns. Also let us 
adopt the notation X = ip^ip and VV = ^ H(k)ip (these 
two functions are identified as the wave action density 
and the energy density, respectively 0), 

N N N 

X = a q h q h r a r = a q S q a r = a q a q = cl 1 a, 

q,r= 1 q,r=l q—1 


N N 

W = a\h q H{k)h r a r = ^ a\h q u r h r a r 

q,r =1 q,r=l 


N 


N 


^ alu r 5 r q a r = J2co q ala q . 


q,r= 1 


9=1 


Then, Eq. (fill) yields 

£ = —Re [i/j^ (—id t — ia ■ V + A) tp\ 

= —IdtO — VV — Re [a^H' l 'a M (—*3^)(Ha)] 

= —Xd t 0 — VV — Im [a^Z^a ,J 'd ll ,(Zd)] 

= -id t e-w + ic, ( 21 ) 

where we introduced 

1C = ^ [a t (H t a M H)9 (U a - c. c.] 

+ |[atHt^(^H)a-c.c.]. (22) 


B. Reduced eigenmode representation 


Since passive modes have nonzero amplitudes only due 
to the inhomogeneity of the medium, one has 


O(e 0 ), q=l,...,N 
O^ 1 ), q = (N+l),...,N 


Then, up to an error 0(e 2 ), one gets 

N 

X ~ aja 9 = a^a, 

9=1 

N 

vv « ^2 uj<la \ aq — a ^£' 


(23) 


(24) 


9=1 


where a 9 = a 9 for q = 1,... ,N, a is a column vector 
comprised of all a 9 , and £ is a diagonal NxN matrix 
with eigenvalues w 9 , 

£ = diag (w 1 , w 2 ,..., w N ). (26) 

It is sufficient to keep only the active-mode contribu¬ 
tion also in 1C, since both d^a and d^Z are already of 
order e or less. Thus, within the accuracy 0(e 1 ), passive 
modes do not contribute to £ at all and can be omitted 
entirely. Instead of H, we hence can use its corresponding 
projection, H, which is a N x N (i.e., nonsquare) matrix 
that has first N vectors h q as its columns. This gives 

£ = —Id t 9 + 1C — a)(£ — U)a, (27) 

where 

K= l - [at(Hta^H)(^a) - (d^^a^a], (28) 

U= l -[ Ht^(^H)-(a M Ht)a^H]. (29) 

C. Group velocity and convective derivative 

To calculate 1C and U with an error less than 0(e 1 ), 
it is enough to calculate H within the accuracy O(e 0 ). 
Then, the difference in the active-mode frequencies can 
be neglected (i.e., w 9 ~ w), and H approximately satisfies 

0 = T-LqZ = (a ■ k + A — I^w)H. (30) 

Differentiating Eq. d30D with respect to k gives an equa¬ 
tion for the central group velocity, Vq = duj/dki ; namely, 

a'H + (a • k +A - - t;oH = 0, (31) 

which leads to 

(I N v l 0 -a i )Z = H 0 ^-. (32) 

Let us multiply Eq. (1321) by Hi. Due to orthonormality 
of h q , we can substitute H^H = ly, and, due to Eq. (l30l) . 
we also can substitute Z^XLo = 0. That gives 

Itvv 0 = Z^aZ (33) 

(in which sense a serves as the group velocity operator). 
Therefore, 

H t a M H dfj = Z^Zdt + ^ccH • V = Ijv(<9t + v 0 • V) = d t 

can be understood as a convective derivative associated 
with velocity vo- [As will be shown below, Vo differs 
from the true average velocity, V, by O^e 1 ).] Hence we 
can simplify the expression for K. as follows: 

/C = ^ [a\d t a) - (d t a^)a\. 


(25) 


(34) 
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D. U as a function of (t, x, k) 

Now let us search for a tractable expression for U. To 
do so, consider 5 as a function 5(t,x, k). (Below we 
assume these arguments by default.) Then, 


where the former term on the right is understood as the 
derivative of 3 with respect to x M at fixed k (and t). 
Substituting this into Eq. (l29l) yields 


d M Z(t,x,k(t,x)) =-^ + ^-d M ki, (35) 


U 


sV 


<95 05* 

dx 11 dx 11 




+ 


5s 


as* 




= K H, “" £ - II °' H ) + 5 ( Ht S - £ H ) ^ + K sW I - S ^ 1 d ’ k " <"> 


Now recall that, by definition, k satisfies the so-called consistency relation, 


d t h = df t 6 = dlfi = -diw(t, x, k(f, x)) = - 


5w(f,x, k) 5w(f,x, k) 


dx l 


dki 


dikj. 


(37) 


[Here and further, df t = d 2 /(dx l dt), etc.] Using also that dikj = d 2 t 0 = d 2 jd = djki , one gets 

i 


U= l - - P~a»Z 

2 V dx 11 dx^ 


dZ 55^ \ 5u;(i, x, k) iR 

2 V dki dkj ) N dx i 2 


(38) 


„ + 5S a»t „ 
dkj dkj 


R = ( 3*-^- — H— 3 ) Ijv^o 5 2 # — ( 5W oPZ ) 5?,-0 


55_5St 
dL dki a 




dZ 55+ 


fll. 07.. “ ) ^ Wl, 0 ( ^ a 07„ 07, 


53 ast 




dki dki 


5fci 5/cj 


Ki 9 


5+ar 7 ' 


as ast 

5fc,- dki 


9fJ> 






(39) 


But, from Eq. 


we know that 


O' 

^(IjvUq — a' 7 ) = -q^ - T~Lo, (IatUq — q;*)3 = Hq 


dh ' 


(40) 


Therefore, R = 0, so the resulting U can be rewritten as follows: 

U = i (v t a M _^L _ du(t,x*) 

2 V“ dx» dx» “/ 2 1“ 5fci dk, “7 5a 4 


-1 


55 55t 

2 \ dt dt 

1 / f 55 55* „ 

2 \ 5t dt 

i / + 5E 


SV 


CTt„* 


55 551 

5x z 5x J 
55t 




5fc,; 


* 55 55+ \ 5w(f, x, k) 

2 \ dki dki J dx l 

i 55 55+ \ 5w(t, x, k) 


55 551 / 5! 

Uo ) — - — ( 3«S - Ho 


dt 


i i 9: 


= - ( 5+ — h. c. ) + - [ S+Un tt— — h. c. ) — - ( 5+ —— h. c. 


dx i 


dx l dx l 
i / 4 . 5E 


dh 


dki dki 


dx * 


dh 


5w(f,x,k) i ( d a* 


dx i 


5E 


dki n ° dx i h ' C ' 


= [H< (3« + v„ • V - T» ■ 4) H - h. c.] + (j|l Wo ^ - h. c. 


(41) 


r 


where we substituted corollaries of Eqs. (E§, 

5V = Iw^-f^o, a i Z = I N v i 0 -n 0 ^ ; . 


Equation (1411) can be rewritten more compactly as fol¬ 
lows. Notice that vq and —Vcu are components of the 
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phase flow velocity in the ray phase space, as determined 
by the zeroth-order (in e) ray equations, 

x(t) = d k u(t, x, k), k (t) = -d x u(t, x, k). (42) 

Thus, it is convenient to introduce the convective deriva¬ 
tive in the ray phase space, or the Liouville operator, 


= d t + x(t) • d x + k(i) • d k E(t, x, k). (43) 

Then, Eq. (TUI) becomes 


[ 7 (t,x,k) = l[st(P t S)-(P t S)tH] 


i 



/5St ^ 
^ a x 0 ak 


ast as 
ak 0 a x 


(44) 


The importance of Eq. (1441) is that it guarantees U to 
be a function of (t 1 x, k) (and not of the gradient of k, 
the dependence on which on has been eliminated when 
we proved that R = 0). Due to this property, —U can be 
identified as an effective “gauge Hamiltonian” [36} with 
a well defined classical limit. In application to specific 
systems such as atoms and guided EM waves with sim¬ 
ple dispersion, related potentials were also studied, for 
instance, in Refs. l37H39j . In application to EM waves 
propagating in weakly inhomogeneous isotropic dielec¬ 
tric media, U serves as the Hamiltonian of the interaction 
between the photon spin (polarization) and the photon 
orbital motion [jj}- Also, for EM waves in weakly 
anisotropic media, U determines the interaction between 
the photon spin and the pseudo-magnetic field that is 
effectively caused by the medium’s anisotropy EMU- 
For waves and particles propagating in rapidly oscillat¬ 
ing backgrounds [2^, U is also recognized as minus the 
ponderomotive energy. Finally, for quantum particles, U 
serves as (or is related to) the SG Hamiltonian. This will 
become clear from Secs. IVIII and IVIIII after we discuss 
the ELEs that flow from the reduced Lagrangian derived 
here. 


E. Lagrangian density: summary 

Let us split the total new Hamiltonian, £ — U, into the 
average energy, 

Ho(t 7 x, k) = 7V -1 Tr(£ — U), (45) 

and the following traceless matrix, 

Q(t, x, k) = £ — U — InHq. (46) 

(Below, we omit Ijy for brevity.) Then, Eq. (1271) can be 
rewritten as follows, 

£ = - [a\d t a) — {d t a*)a\ — ( d t 8 + H 0 ) a*a — a^Cla. 

(47) 


Since k = V0, both Hq and Q are functions of (f, x, V0), 
and so is Vo; then, 

d t = d t + v 0 (t,x, V8) • V. (48) 


Notice also that 

^ [a\d t a) - (i d t at)a] = [a\d t a) - {d t a^)a\ 

+ \ t atv o • (Vo) - (Vo f ) • v 0 a] 
= ia^dt.a + — (V ■ vo)a^a 
- l -d^a ] vSa), 

where Vq = 1. Since the divergence term can be dropped, 
this leads to the following equivalent form of the LD: 


£ = a* 


idt + ^ (V ■ v 0 ) - (d t e + H 0 ) - fi 


(49) 


Equations (TT5]) ~ (TTD1) . in combination with Eqs. (ITUl) and 
(1771) for U and with Eq. (1771) for vo, are the main results 
of this section. Below, we discuss how to apply these 
results in various cases. 


V. SCALAR-WAVE LIMIT 

To the lowest, zeroth order in e, the LD can be approx¬ 
imated simply with 

£ = —{dtO + Ho)a l a = —{dtO + Hq)I 1 (50) 

where Hq also can be replaced with £. One may recognize 
this as the standard LD for a scalar wave in Hayes’s form 
[40| . which corresponds to the e°-theory. Such LD is pa¬ 
rameterized by just two independent functions, the rapid 
phase 8 and the total action density I. The correspond¬ 
ing ELEs are the action conservation theorem (ACT), 

58: dtl + V • (Xv 0 ) = 0, (51) 

and a Hamilton-Jacobi equation, 

52: d t 8 + H 0 (t 7 x, V0) = 0, (52) 

which can be understood as the local dispersion relation, 

w = L/o(f,x,k) (53) 

[cf. Eqs. (1171) and (1171) ]. The resulting dynamics do not 
need to be discussed here, since it is exhaustively covered 
in literature. For an overview, see, e.g., Refs, [lj, y|. 


VI. PRESCRIBED 8 
A. Basic equations 

Now let us include terms in £ of the first order in e. 
Postponing a comprehensive discussion until Sec. IVIII 
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here we outline polarization effects in a simplified man¬ 
ner, namely, as a perturbation to the scalar dynamics de¬ 
scribed in Sec. GO Within this approach, we can choose 9 
(which has been unspecified so far) as a prescribed func¬ 
tion satisfying Eq. (1521) . Then Eq. (Id9l) becomes 


£ = a* 


id t + - (V • v 0 ) - 


(54) 


where the vectors a and ad can be adopted as independent 
variables. (The latter would have not been possible if # 
were itself to be found; see Sec. 1VIII ) This leads to the 
following ELE, 


5a? : 


idta = f la — — (V • Vo )a, 


(55) 


and the adjoint ELE for ad. As a corollary, the following 
equation for I is yielded, 

dtT = a^dta + c. c. = —(V • vq)X, (56) 


which is equivalent to Eq. ©. Hence, one can simplify 
the motion equation by introducing normalized variables, 

b = I -1/2 a, tfb = 1. (57) 

The new variables satisfy 

xd 

idtb = I~ 1 ^ 2 idta — ^ b = Qb. (58) 

This equation shows that, in the frame traveling with 
velocity Vo, the vector b rotates with matrix frequency 
H, and W = cdfla serves as the mode-coupling energy. 


B. Wave spin 

Let us also describe the rotation of b as follows. As 
a traceless Hermitian matrix, Q can be represented as a 
linear combination of N 2 — 1 generators T u of SU(iV), 
which are traceless Hermitian matrices, with some real 
coefficients — W u [4l| : 

N 2 -1 

n = - TuWU = TW (59) 

u =1 

It is instructive to introduce the (N 2 — l)-dimensional 
vector S = b^Tb, so that W = Z(f2), where (fl) = —S-W. 
The components of S satisfy the following equation: 

d t S w = tfT w (d t b) + (d t b^)T w b 
= ib^flT w b — ib^T w flb 
= itf(QT w -T w fl)b 
= itf[n,T w \b 
= itf{T w ,T u }bW u 

= —fwuv{b^T v b)W u , 

= f W vuS v W u , (60) 


where f wuv are structure constants, which are antisym¬ 
metric in all indexes pH] ]. 

For example, consider the case when only two waves 
are resonant. Then, N 2 — 1 = 3, T v are the three Pauli 
matrices divided by two (so |S| 2 = 1/2), and f wuv is the 
Levi-Civita symbol, so f W v U S v W u = (Sx W) w . For a 
Dirac electron, which is a special case (Sec. IVIIII) . such 
S is recognized as the spin vector undergoing the well 
known precession equation, dtS = S x W. (One may 
also recognize this as an equation for the Stokes vector 
that was derived earlier to characterize the polarization 
of transverse EM waves in certain media [§j, Enao 
Hence, it is convenient to extend this quantum termi¬ 
nology also to N waves. We will call the corresponding 
(TV 2 — l)-dimensional vector S a generalized spin vector 
and express f WV uS v W u symbolically as (S * W) w , where 
* can be viewed as a generalized vector product. Then, 
Eq. (1601) is rewritten compactly in the following vector 
form, 

d t S = S * W, (61) 

and is understood as a generalized precession equation. 


C. Applicability of the prescribed-# model 

In some cases, the prescribed-# model can be entirely 
sufficient. For example, suppose both Ho and fl are con¬ 
stant. Suppose also that 9 changes only along a single 
axis, x , and is independent of transverse coordinates, p. 
(An example might be the case of a charged quantum 
particle traveling parallel to a constant magnetic field.) 
Then £ can be integrated over p, so a Lagrangian lin¬ 
ear density is yielded that describes the one-dimensional 
motion along x. Due to f1 being constant and small, b 
oscillates at the same small rate everywhere and thus re¬ 
mains slow compared to 9. Then the prescribed-# model 
is valid indefinitely. 

However, if H is inhomogeneous, phases of b at dif¬ 
ferent locations grow at different rates. Then, even 
though fl is small, the assumption of slow £ is eventu¬ 
ally violated, and hence the prescribed-# model cannot 
be trusted. (See also Sec. I VIII Dl ) Equations that remain 
accurate on larger time scales, therefore, can be derived 
only within a model where the rapid phase # is calculated 
self-consistently. Such model is discussed below. 


VII. SELF-CONSISTENT 6 
A. Fluid model 

Let us adopt (#,I) as independent variables, like in 
Sec. E] The remaining variable, 6, can be parameterized 
with the (TV— 1) spherical angles C, r on the V-dimensional 
sphere and some (N — 1) relative phases ■# <J of individual 
b q ; however, the remaining, Nth phase is not indepen¬ 
dent. To account for that, this time we define # such 











that, for example, a N be real. This leads to a new set of 
normalized variables, 77 . (Like 6 , the vector 77 is defined 
via a = X 1 / 2 77 . However, we distinguish b and 77 because 
the underlying definitions of a is different due to the fact 
that the corresponding 9s are defined differently.) The 
components of 77 can be parameterized as 

7 ? « = ^(C 1 ,...,C JV - 1 )x|^ 9 ’ . (62) 

Here are the well known real functions parameterizing 
the location of a point on a unit sphere, 


Hamilton-Jacobi equation. — The second ELE, 


N-l 

SI : d t 9 + H 0 + (n)-J2 ( $9 ) 2 d t# q = 0 , ( 75 ) 

9=1 

can be considered as a generalization of the Hamilton- 
Jacobi equation to vector waves. 

Equation for 9. — Another set of (N — 1) ELEs is 


SC : 


<9(H) 

dC 


d (4>«) 2 

£ Q£ r 


d t d q = 0 . 


(76) 


■h 1 = cos(C 1 ), 

$ 2 = sin(C 1 ) cos(( 2 ), 

$ 3 = sin(^ 1 ) sin(^ 2 ) cos(( 3 ), 


(63) 

(64) 

(65) 


Equation for (,. — Another set of (N — 1) ELEs is 
6V r : S t [Z(<F) 2 ] + V • m r ) 2 vo] + 2 :® = 0 . (77) 


fc *" 1 = sin(C 1 )... sin(C Ar “ 2 ) cos(C Ar “ 1 ), ( 66 ) 
= sin(C 1 )... sin(C Ar “ 2 ) sin(C Ar_1 ), (67) 

so J ^£($ 9 ) 2 — 1- Notice that the LD in Eq. (H71) is 
then expressed as 


Each of these equations describes the evolution of the 
action of an individual (rth) mode, Z(<f> r ') 2 . By using 
Eq. (1701) . one can rewrite Eq. m also as follows: 


I(d t + v • V)($ r ) 2 + v • [X($ r ) 2 (v 0 - V)] 


-x^M = o. 

dd r 


(78) 


£ = -I{d t 6 + H 0 + (H)) + — [77 f (rft77) - (68) 

or, more explicitly, 

£ = -X [d t 0 + H 0 (t, x, VO) + x, VO , C)] 

N-l 

+ lJ2 [^(C)] 2 (5 t +v 0 - V)7? 9 , (69) 
9=1 

where the derivatives of C canceled out. (Note that we 
use 9 to denote the whole set of N — 1 variables 9 r , and 
similarly for £.) This leads to the following 2N ELEs. 

Action conservation theorem. — The first ELE is the 
ACT, 


Note that, in the case of a localized wave packet, averag¬ 
ing over the packet area eliminates the divergence term 
and predicts the advection of (<f > r ) 2 at velocity V. 

Combined together, the 2N equations derived in this 
section can be viewed as a generalization of the classical 
spin-fluid equations that we earlier derived j27| for a Pauli 
particle (N = 2). The generalization consists of the fact 
that the new equations apply to general waves (e.g., clas¬ 
sical EM waves), as opposed to specific waves of quantum 
matter. In particular, the spin is a ( N 2 — l)-dimensional 
vector now, in contrast to the three-dimensional spin of 
a Pauli particle. 

B. Point-particle model and ray equations 


59: d t I + V • (XV) = 0, (70) 


which is a continuity equation for X. The corresponding 
flow velocity is V = v + u, where 


v = <9 k Ho + <«9kSX>, 


JV-l 


du 




9=1 


(71) 

(72) 


Notably, one can also recast the latter formula as u = 
M^ 1 • k, where M is understood as the mass tensor of a 
wave quantum, and k is the wave vector of 77 ; namely, 


To the extent that a wave packet is well localized such 
that it is meaningful to describe its dynamics as the dy¬ 
namics of the packet’s geometrical center, the continuous- 
wave description developed above can be replaced with 
a simpler, point-particle model. In this case, one can 
approximate the action density with a delta function, 

X(i,x) = 5(x-X(f)), (79) 

so the LD can be replaced with just a point-particle La- 
grangian, X = f £ cl 3 x. Following the same approach as 
in Ref. [27}, one obtains 


1VT 1 = 


d 2 co 

dkdk' 


K = ivH'Vv) - (V 77 ) 777 ] = -itfVr). 


(73) 

(74) 


L = P • X — H 0 (t, X, P) — <fi) (t, X, P, 9, C) 


N-l 

+ £[* 9 (C)] 2 ^, (80) 

9=1 
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where P(f) = 

V0(f, X(t)); the angles ft and ( are also 

evaluated at x 

= x(t). 

. The corresponding ELEs 

are 

<5P 

: X = 

dp [H 0 

+ m, 

(81) 

SX 

: P = 

-c*x[tfo + (ft>], 

(82) 

sc 

N—l 

■ E 

7=1 

<9(4 >«) 2 

&C 


(83) 

Si9 r 

N-l 

■ E 

q= 1 

a ($ r ) 2 

dC 


(84) 


[Notably, (f2) can be viewed as the Hamiltonian for (i9, £), 
but the equations for (i9, () have a non-canonical form. 
In contrast, the equations for X and P are canonical.] 
These equations can also be considered as ray equations 
for the fluid equations derived in Sec. IVII Al 

As before, an equation for the spin vector flows from 
the ELEs as a corollary. At least for a Pauli particle, this 
is easily shown by a straightforward calculation; see also 
Ref. [27]. However, instead of rederiving the spin equa¬ 
tion from scratch, let us propose the following argument. 

C. Complex representation 

Instead of (i9, (), one can also use as indepen¬ 

dent variables, if one requires explicitly that 77 be con¬ 
strained by 77^77 = 1 and r^ N = r] N . The constraints can 
be implemented by introducing two Lagrange multipliers, 
]i and iu, so that the resulting Lagrangian becomes 

L = P X - H 0 ( t , X, P) + l - (77+77 - 17+17) 

- 77 + 0 (t, X, P )r] + 714(77+17 - 1 ) + iv { r)lf - V N ). ( 85 ) 
The corresponding ELEs are as follows: 


S VI<N 

iff = (flr]) q + 7 irf , 

( 86 ) 

Sr] q< N 

-iff q = ( 77 +n), + 77 In, 

(87) 

Sr] ] N 

if] N = (Qr]) N + ]it] N — iu, 

( 88 ) 

Sr]N 

-iflN = (?7 f ^)iv + VnU + iv, 

(89) 

S]i 

77+77 — 1 = 0 , 

(90) 

Sv 

o' 

II 

H-fe: 

(91) 

<5P 

X = dp(H 0 + ^flr]), 

(92) 

SX 

P = -d x (H 0 + ^Q V ). 

(93) 


By comparing Eqs. (1861) and (l87l) . it is seen that 7 1 must 
be real; then, Eqs. (|89l) - (l9Tll show that u is real too 01. 
Now let us introduce U = fl + /jIn to write 

1l q = —i(Ur)) q - vS q ,N, (94) 

where S qi N is the Kronecker symbol. This leads to 

A ( 77 + 77 ) = -2ur] N , (95) 


since U is Hermitian. But, from Eq. (l90l) . we know that 
d(ryrj)/dtt = 0. Since r] N cannot remain zero identically, 
this leaves us with u = 0, so the entire vector 77 satisfies 

if] = Ur], (96) 

Notice, however, that these ELEs do not form a closed 
system, as there is no independent equation for ]i. The 
issue can be evaded by rewriting the ELEs in terms of 

z(t) = r](t) exp J ]i(t') d. (97) 

According to Eq. m, z satisfies 

iz = flz. (98) 

We then obtain, like in Sec. IVI B1 that 

S = S*W, S = z+Tz. (99) 

Equations m and (l93l) can be expressed through S too, 

X = dpH Q - S • <9 P W, (100) 

P = -3 x Ho + S • 3 X W, (101) 

where Hq and W are functions of (t, X, P). 

In contrast to the ELEs, Eqs. (I59l) - (I101I) do form a 
closed system. It is also seen now that the effect of po¬ 
larization on the wave ray, or point-particle, dynamics 
is akin to that of the SG Hamiltonian on a Pauli parti¬ 
cle [131 [except, in general, the vector S is a ( N 2 — 1)- 
dimensional]. Also, it is to be noticed that this general¬ 
ized SG Hamiltonian can contribute to the expression for 
the particle velocity, X, so the latter is not necessarily 
equal to the average group velocity, dpH^. 

Notice, finally, that the resulting equations for vari¬ 
ables (X, P, z\ z) also can be assigned a Lagrangian, 

L = P • X — Ho(t, X, P) 

+ — (z+i — C z) — z+fl(f, X, P)z, (102) 

assuming that the initial conditions are restricted to 
z+z = 1. Specifically, the corresponding ELEs are 


dP : 

X = dpH 0 - S • <9pW, 

(103) 

SX : 

p = -d x tfo + s • a x w, 

(104) 

Sz + : 

i = —ifiz, 

(105) 

Sz : 

C = i(fiz)+ 

(106) 


and yield the spin equation (l99l) as a corollary. But keep 
in mind that these equations [as opposed to Eqs. m- 
(1511) ] do not allow finding the full vector 77 , because 77 in 
the variable transformation (1971) remains unknown. 
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VIII. EXAMPLE: DIRAC PARTICLE 

In this section, we apply the above formalism to a Dirac 
particle considered as an example. Our goal is to obtain a 
first-principle Lagrangian e^theory that describes Dirac 
particles as point particles with spin. In Sec. IVIII D1 we 
will also discuss how our theory relates to other existing 
theories of classical and semiclassical Dirac particles. 


A. Basic equations 


First, let us introduce the commonly known Dirac LD, 

£ = - qA^) + j3m]ip. (107) 

Here q and m are the particle charge and mass (the as¬ 
sumed sign convention is that q < 0 for electrons), A^ is 
the four-vector potential, a and /3 are the corresponding 
Dirac matrices, 


a 


o 





I 2 is a 2 x 2 unit matrix, and er are the Pauli matrices. 
(The Dirac gamma matrices and the Dirac adjoint do not 
need to be introduced for our purposes.) This £ can be 
cast in the form ([ 6 ]). This is done by adopting 


A = — qa ■ A + /3m, (108) 


while the Dirac alpha matrices serve precisely as the al¬ 
pha matrices of our theory (but now they are constant). 

The total number of modes is N = 4, and the frequen¬ 
cies u q (corresponding to the limit e = 0 ) are 


w 1 = w 2 = oj + = +£ — qAo , (109) 

w 3 = w 4 = oj- = — e — qA 0 . (HO) 

Here e is the kinetic energy, introduced as 

e(p) = \/m? + p 2 , (HI) 

p is the kinetic momentum, introduced as 

P = P-qA, (112) 


The effective potential U can be calculated straightfor¬ 
wardly using Eq. ([29]) . To calculate d^Z entering that 
equation, let us consider 5 as a function 


E(i, x) = H(e(i, x), p(f, x)). (114) 

Then, U = —{Vt + V x + Qt + Qx), where (Appendix IB 11) 


V t = Im ( Si d t pj ) = 


sV 


d Pj 
dZ 
dPj 


V x = Im 
Qt = Im 

Q x = Im ( -Ip 1 ^ di£ ) = 


v x d t p 
2 (m + e) 

qB 


E'||a £ )=o, 

d~ 


v x Ve 

2 (m + e) 


(115) 

(116) 

(117) 

(118) 


Here, E and B are the electric and magnetic fields, and 
v = p/e is the particle velocity (unperturbed by the spin 
coupling). Since 


Ve « V (~d t e + qA 0 ) 

= —dt'Vd + qVAo 
= - dt{ve - qA) ^ qdt A + qVA 0 
= -dtp + q (~d t A + VA 0 ) 

= -d tP + qB, (119) 


one can rewrite Q x also as follows, 


Qx 


v x d t p 
2 (m + e) 


qv x E 
2 (m + e) 


( 120 ) 


Then, the terms proportional to <9tp cancel out (as ex¬ 
pected from Sec. HVDl . and one arrives at 


U = er 


qB 

2 e 


qv x E 


2 (m + e) 


= -<T W, 

2 


( 121 ) 


where we introduced 


w ^ qv X E = q_ 

e m + e mj 


B - 


7+1 


v x E , (122) 


and the standard notation P = k is introduced to denote 
the canonical momentum. (In our notation, P and k do 
not need to be distinguished because h= 1 .) 


B. Effective potential 


For clarity, let us consider the case where there are 
only two active modes (N = 2 ), specifically, those cor¬ 
responding to spin-up and spin-down particle; hence, no 
antiparticles are considered. Then, as it is well known [9|, 


m + e 
2e 


I 2 

o-p 

m+e: 


(113) 


and 7 = e/m is the particle Lorentz factor (unperturbed 
by the spin coupling). Since U is traceless, we obtain 

Q = -U = ~( t - W, H 0 =u+. (123) 

The corresponding fluid and point-particle equations are 
hence derived as described in Secs. |VT] and IVIII In par¬ 
ticular, the point-particle spin vector satisfies 

S = S x W, (124) 

so W, which is given by Eq. (112211 . serves as the spin 
precession frequency. This agrees with the Thomas pre¬ 
cession equation Also note that, Eq. (TT7I) with 
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Eq. (11231) taken in the nonrelativistic limit yields 


D. Comparison with other models 


£ = - [a\d t a) - (dta^a] - (d t 9)a^a 


— a 1 


2^ (™ - ? A ) 2 - g4) - ^ (<r • B ) 


a. 


Substituting here a = X 1 / 2 n leads to precisely the LD 
of a classical Pauli particle [23]. Thus, it also subsumes 
the classical limit of the Takabayasi equations 0 ], as 
discussed in Ref. 0 . 


C. Anomalous magnetic moment 


Now let us also consider the correction due to an 
anomalous magnetic moment. This correction is de¬ 
scribed by adding to £ the Pauli term 0 , 0 , 


£ = — 


~ g 

(l-i 

) 0a^F uu 

-4m 

\2 





(125) 


Here g is the g-factor, C+" = (i/ 2 )[ 7 ^, 7 "] is the rela¬ 
tivistic spin operator, 7 ^ = /3a 1 * are the Dirac gamma 
matrices, and F^ is the EM field tensor. Then, 


o^F gv = 2 


(-a ■ B 
y icr • E 


icr ■ E \ 
-ct-B )’ 


which leads to (Appendix IB 21) 


(126) 


^Pa^F^if « -2a f 


cr ■ B — (v x E) • cr 


7 


7+1 

Substituting this into Eq. (11251) . we get 


(<r ■ v)(B ■ v) 


£ = 


g 

2m 


(I- 1 ) 


— 1 a' era ■ 


B-vxE- 


7 + 1 


(B • v)v 


Then, we can express the contribution of the gauge 
Hamiltonian to the LD as 

a0Ua + £ = a^U g a, (127) 

where we introduced U g == cr ■ W ff /2 and 


w 9 = 


f-! + - 

2 7 


B - 

-(I- 1 ) 


7 


7+1 

7 

7+1 


v x E 


(B 


It is seen that the effect of an anomalous magnetic 
moment is described by replacing U with U g or, in other 
words, by replacing W with W 9 ; hence, the corrected 
spin precession frequency is W 0 . This leads to the well 
known BMT equation for S 0, 0], which is thereby 
seen to be subsumed by our theory as a special case. 
Yet, in contrast to the BMT theory, our formulation also 
captures the SG force [for example, see Eqs. (11001) and 
( 1 101 D ] and thus remains manifestly conservative also in 
the presence of field gradients. 


The sample application of our theory to a Dirac parti¬ 
cle discussed above can be viewed as a complement to the 
many spin-particle models yielded by an almost century- 
long research. The relevant literature is too extensive to 
be surveyed here, so we refer the reader to already exist¬ 
ing reviews, e.g., Refs. 0-0]. That said, let us briefly 
outline how our study fits the general context. 

1. — On the score of being manifestly Lagrangian, 
our theory is reminiscent of that by Barut et al [47H50l ] . 
However, in contrast to ours, the latter (i) is constructed 
axiomatically, rather than by deduction, (ii) does not 
capture the SG force (at least, manifestly), (iii) is solely 
a point-particle theory, rather than also a fluid theory, 
(iv) and relies on the concept of proper time, which 
is introduced ad hoc and in a debatable manner JdJ. 
Also importantly, Barut et aVs theory (as well as many 
other classical-spin theories) is formulated in terms of 
bispinors, as opposed to spinors. We believe that this 
is an unnecessary complication. A general bispinor de¬ 
scribes a superposition of a particle and an antiparti¬ 
cle. But the slow-envelope approximation generally is 
not possible for such an object; then the very concept of 
a point particle remains undefined. Moreover, a particle 
and an antiparticle respond very differently to external 
fields, so they do not travel as a whole. Thus, a point- 
particle model for them may not be physically meaning¬ 
ful, even when it can be constructed formally j52| . Our 
theory avoids this issue by being explicitly restricted to 
pure (i.e., particle or antiparticle) states. Then the in¬ 
ternal state is described by two rather than four complex 
numbers, which also simplifies the analysis. 

2. — In application to a Dirac particle specifically, the 
separation of pure states that we used can be interpreted 
as the Foldy-Wouthuysen transformation 0] expanded 
asymptotically in e. In this sense, our calculation is also 
related to that in, e.g., Refs. 0, [54, 0, but with the 
following difference. In contrast to other papers, and as 
we already pointed out in a different context, our the¬ 
ory is constructed straightforwardly by deduction from 
the quantum LD. We do not postulate additional sym¬ 
metries, Poisson brackets, or any other correspondence 
between quantum and classical dynamics except the GO 
limit. The state function if does not need to be related to 
probability in our formulation 0 ], and thus expectation 
values do not need to be introduced. (The same applies 
to Weyl symbols, which fact also simplifies the derivation 
conceptually.) In other words, our results are formal and 
do not require an interpretation. 

3. — As a GO theory, our formulation is related to 
the many existing semiclassical theories of Dirac parti¬ 
cles (for instance, see Refs. 0,0) , but it is different 
from those theories. First, typical semiclassical calcu¬ 
lations assume 6 to satisfy the zeroth-order Hamilton- 
Jacobi equation (1521) and thus do not extend beyond what 
we call the prescribed-# model. As explained in Sec. IVI Cl 
this model is generally inadequate at large times, and 
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that is the reason why standard semiclassical expansions 
fail to capture the SG force. A related discussion can be 
found, for example, in Refs. [HI, .25]. Second, our theory 
is Lagrangian and, as such, leads to manifestly conserva¬ 
tive equations. The Lagrangian formulation is also what 
allows us to introduce a classical theory unambiguously 
(see also Ref. E3) and keep the equations manifestly con¬ 
servative. Moreover, the general theory presented in this 
paper can be readily reformulated to treat also rapidly 
oscillating quasiperiodic fields, in which case U becomes 
a ponderomotive potential. For that, one needs only to 
replace the assumed Eq. (II 131) for the zeroth-order fun¬ 
damental matrix, S, with a more appropriate expression 
such as a Volkov solution Hi. (Ezi- We leave reporting 
such calculations to future publications. 

/. — Finally, and most importantly, our theory is not 
restricted to Dirac particles. It is applicable, in fact, to 
general waves, both quantum and classical. Below we 
discuss this in more detail. 


IX. DISCUSSION 

A related discussion where polarization effects are ex¬ 
plored for general waves can be found in Ref. ;B}. An 
expression for the energy akin to our D [with U given by 
Eq. (1441) ] also appeared there. However, there are impor¬ 
tant differences between our work and Ref. [B|, which are 
as follows. First, the physical meaning of the mentioned 
term is not identified in Ref. [B|; the term is said to have 
“no name”. In contrast, we identify 0 unambiguously as 
the (generalized) SG Hamiltonian, when we study Dirac 
and Pauli particles as a special case. Second, and most 
importantly, Ref. [@] is restricted to adiabatic dynamics. 
This means that the action is assumed to be conserved in 
each mode individually, so the analysis in Ref. [B} is inap¬ 
plicable to systems with degenerate spectrum, including 
Dirac particles. For the same reason, the many existing 
studies 0 on the Berry phase effect in the context of a 
Dirac particle (and beyond) are not directly related to 
the subject of our paper. 

Nonadiabatic dynamics, which includes mode conver¬ 
sion, was studied, e.g., in Refs. [Ell, [E§], but in the re¬ 
stricted context of an asymptotic scattering problem and 
only for two-component waves [6(1 . In contrast, our the¬ 
ory introduces mode coupling directly in ray equations 
(Secs. IVII Bl and IVII Cl) and describes simultaneous res¬ 
onant interaction of arbitrarily many modes. (The lat¬ 
ter and the manifestly Lagrangian formulation also dis¬ 
tinguish our work from the already mentioned studies 
jul I12| of transverse EM waves in relatively simple me¬ 
dia.) This is important, for example, for an adequate 
description of autoresonant wave-wave interactions El- 
Other applications of the general theory could be in de¬ 
scribing nonadiabatic polarization dynamics in nonsta¬ 
tionary inhomogeneous plasmas or other media for which 
the existing theories are insufficient. 

Although wave Hamiltonians H may not be known ex¬ 


plicitly in such applications, our theory still can be ap¬ 
plied as follows. Suppose a set of N linear equations of 
the first order in d^. (For example, in plasma physics, 
those can be linearized hydrodynamic equations com¬ 
bined with Maxwell’s equations.) Then, to the zeroth 
order in e, the equation for all field variables combined 
as a vector 'F is readily cast in the form 

+ =0. (128) 

Now suppose that we are dealing specifically with nondis- 
sipative waves. By definition of such a wave E2 , there 
is a variable transformation T if} that casts Eq. (I128D 
in the form m E3 , i.e., with Hermitian a and A. Once 
those matrices are found, one can find also S, v 0 , U, 
and £ (see also Sec. 7.5 in Ref. 0)- Furthermore, the 
underlying formulation can be extended to a multifluid 
description of vector waves; then it becomes applicable, 
for example, to improving ray tracing in plasmas. Dis¬ 
cussions of these and other applications of the general 
theory presented here are left to future publications. 


X. CONCLUSIONS 

In summary, we developed a general first-principle 
eAtheory of resonant nondissipative vector waves. We 
start with deriving the reduced LD that describes cou¬ 
pling of arbitrary N eigenmodes in weakly nonstationary 
and inhomogeneous medium. The coupling term can be 
understood as an effective gauge Hamiltonian of the or¬ 
der of the GO parameter, e. As such, this gauge Hamil¬ 
tonian serves as a generalization of the SG Hamiltonian 
that is commonly known for spin-1/2 quantum particles. 
We show how to parameterize the LD in order to obtain 
dynamical equations for continuous GO waves and also 
for their rays. These equations describe both adiabatic 
and diabatic mode conversion of all N resonant modes 
simultaneously. We also show that the dynamics of the 
Wdimensional complex polarization vector can be rep¬ 
resented as the precession of a real ( N 2 — l)-dimensional 
fixed-length vector, which is interpreted as the wave spin. 
As an example, we apply our theory to derive a classi¬ 
cal model of a Dirac particle. We show that our for¬ 
mulation leads to the well known BMT equations but 
with added SG energy terms. Our calculation is the first 
one that formally deduces these corrected BMT equations 
from quantum theory without postulating any quantum- 
classical correspondence except the GO limit. 

Overall, the advantages of the proposed theory are (i) 
its variational form, leading to manifestly conservative 
equations, and (ii) the fact that the theory assumes no 
specific wave equation and thus treats classical and quan¬ 
tum waves on the same footing. Also, the new theory is 
naturally suited to serve as a stepping stone for study¬ 
ing ponderomotive effects on vector waves and particles. 
Reporting this and other applications of the general for¬ 
malism presented here is left to future publications. 
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Appendix A: Linearization of Hamiltonians with 
respect to k 

1. Hamiltonian representation 

Consider a Schrodinger equation, 

id t A> = H'F, (Al) 

where the Flamiltonian H is a matrix polynomial of 
k = —id x of some order n > 1. (We assume one¬ 
dimensional coordinate space for simplicity, but general¬ 
izations to multidimensional spaces are straightforward 
to apply.) Since the Hamiltonian is Hermitian, it can be 
expressed as 

f\ = (A 0 + A 1 k + ...+A n k n ) H , (A2) 

where A m = A m ,H + iA m ,A are matrix functions of x. 

Let us focus on the term that is of the highest order in 
k. It can be written as 

{A n k n ) H = Qn+Kn, (A3) 

where we introduced the following Hermitian operators, 

Qn = \ (A n , H k n + k n An, H \ (A4) 

Kn = l -[A n ^k n i (A5) 

and [•, •] is a commutator. Notice that !Z n = o(k n ), 

where o(k n ) denotes an operator such that, when ap¬ 
plied to ’P, it contains derivatives of 'F only of orders less 
than n. Then, H = Q n + o(k n ), where we used the fact 
that the remaining terms in Eq. (IA2I) are also o(k n ). 

As a Hermitian matrix, A h .h can be expressed as 
A nt H = U^DU, where U is unitary, and D is diagonal 
with real eigenvalues. Then, H = U'KU + o(k n ), where 

K = i ( Dk n + k n D) (A6) 

is diagonal. Let us represent the eigenvalues of D as 
D q = E g (dg)", where Eg = sgnDg, and d q > 0. Then, 

k = - 2 diagg{Dgfc" + k n D q } = diagg{GgEg} + o(k n ) 

[diaggjAg} denotes a diagonal matrix with eigenvalues 
Ag], where we introduced the scalar Hermitian operators 

Gg = i (dqk + kd q ). (A7) 


Introducing (commuting) matrices G = diag 9 {Gg} and 
E = diaggjEg}, one can further rewrite K compactly as 

K = G"E + o(k n ). (A8) 

Now consider a new variable, if) = . Using Eq. EH) 

and the fact that I/tU is a unit matrix, we obtain a 
Schrodinger equation for t/j, 

idti/j = Hip, (A9) 

where H is a Hermitian operator given by 

H = H + G n E, H = o(k n ). (A10) 


2. Phase space extension 


Let us introduce an auxiliary function 

x = ^GHip, (All) 


where g is a real constant parameter that is yet to be 
specified. In combination with Eq. (1A11I) . Eq. (1 ADI) can 
be represented as the following pair of equations, 

id t ip = Hip + /iG n -\x, (A12) 

0 = p, 2 Y(p- 1 Gi:iP-x), (A13) 

where the operator p?Y must be invertible but otherwise 
can be chosen arbitrarily. Equations (IA12I) and (IA13I) 
can be considered as a special case of the vector equation 



H nG n ~ 1 \ / ip \ 
fxYGY —p, 2 Y J \x J 


(A14) 


in the limit p, —>• oo. Finally, choosing Y = G n 2 E turns 
Eq. (IA14I) into a Schrodinger equation with a manifestly 
Hermitian Hamiltonian operator of order (n — 1) in k: 



H pG n_1 
pG n_1 —p 2 G n_2 E 



(A15) 


This shows that, by extending the state function, one 
can reduce the order of the spatial derivative entering 
the Hamiltonian from n to (n — 1). It is then seen, by 
induction, that one can also reduce it to the first order. 
In the limit p —> oo, the additional dispersion branches 
that emerge due to this procedure correspond to infinite 
frequencies and thus cannot affect the dynamics of ip. 


3. Example 

Let us consider a simple case with n = 2 as an example. 
Specifically, suppose 


H = g 2 k 2 + g, (A16) 
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where g is a real constant, and g = o(k 2 ). Then, K is a 
scalar operator, K = G = gk. This leads to the following 
equation in the extended space, 


V x = Im ( ^a l diPj 


= Im 


id* 


^ \ = ( s 


(A17) 


X ) \ pgk —p 2 J \ X J 

When g is homogeneous, Eq. (IA17I) has eigenvectors 


= Im 

= Im 


m + e 

(I 2 

(Tp 

2e 

m+£ 

m + e 

(I 2 

<T p 

2e 

m+£ 


0 a 1 
cd 0 


0 

O" 7 

m+e: 


diPj 


a a • 

m+e ) diPj 


(o- ■ V)(cr ■ p) 

2e 



_ g— iu)t-\-ikx f j 

(A18) 

= Im 

■(V-p) + i(V xp)-ff 

V xy 

V Xo J 


2e 


where ipo and xo are complex constants. The dispersion 
relation connecting ui with k is 

(A19) 

where g(k) is the eigenvalue of g. This leads to 

wi ,2 = g ^ ) 9 /f ±~ Q{k)] 2 + 4/r 2 H (k), 

where H(fc) is the corresponding eigenvalue of H. In the 
limit p -+ oo, one of these solutions approaches H(fc), as 
intended, whereas the other one corresponds to infinite 
frequency and thus can be ignored. 


Appendix B: Auxiliary calculations 

Here we present auxiliary calculations that are used in 
Sec. IVIIll to derive U for a Dirac particle. 


1. Expressions for Vt, V x , Qt, and Q x 

Using that 


m 

/m + e 

5 f 

' I 2 

d Pj ' 

(/ 2e 

% ' 

£ P 

^ m+£ 


m + e f 0 

V 

\ m-\-£ 


2e 


we obtain the following expressions for Vt and V x : 


' Pl = Imi -§^ dlP ‘ 


= Im 

= Im 
= Im 


m + £ ( TT <r p \ ( 0 

^ — E ) _2f_ 

V m+£ 

(cr ■ p)(cr • d t p) 


dtPj 


2e(m + e) 

(dtp) ■ P + *(P x d t p) • cr 


2 e(m + e) 


v x d t p 
2 (m + e) 


(Bl) 


= ^(V xp )-cr 

= -(VxA)-, 

qB 



-- cr. 

2s 


Using that 


as 


d 

d£ 

I _c^p_ 1 

\ rn+E / 

&£ 


Im + e 



V 2e 



rriE 



(B2) 


2 e(m + e) 


2s 

m 


1 m + e 

a 

/ 2e 

ae 


h 


h 

cr P 


cr P 

( m+e ) 2 


m + e 
2e 


0 

<r-p 


, (m+e) 2 , 

we also obtain the following expressions for Q t and Q x : 
Qt = Im ( S f ^ d t e 


= — Im 

— Im 

= — Im 

= — Im 
= 0 , 


mdt£ 

2 e(m + e) 
m + e 


2 , ( ' 2 S) y^ 


0 


mdt£ 

2 e(m + e) 

P 2 

2 e(m + e) 2 




— Im 


(cr •p)(<r -p) 

2 e(m + e) 2 


9 t e 


d t £ 


(B3) 




,-dE 


= — Im 

— Im 

= — Im 

— Im 


de 

m 


2 e(m + e) 
m + e 


2e 


S^a*!E!0,;e 

0 cr 1 
cr 1 0 




<r-p 1 dj£ 

(m+e ) 2 


2 e(m + e) 

(o- • Ve)(cr ■ p) 

2e(m + e) 


l 2 (v 0 • V)e 


v x Ve 

2 (m + e) 


(B4) 
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2. Anomalous magnetic moment 

Here, we show that 


i^PaavF^ij} = ■ 


■ a) (I 2 


<t p \ fh 0 

m+e> { 0 -I 2 


—cr • B icr ■ E 
icr ■ E —cr • B 


h , 

erp | CL 

m+ey 


t/ir -p x/o- • B - icr • E 

« ( Ia m+e) Ucr-E-cr-B-^E.) ° 

V m+£/ 

B ■ a - iSj£ + (Ex p ) ct \ 
iff ■ E - ^JL - I 

m-\-£ m-\-e / 

B • cr — + (E X P) • a ■ E - ' P B P ia • P ( B >< P) ' °- 



m + e 


m + e 


m + e 


m + em + e m + e m + e 
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(E X p) • cr 

p X E) • cr cr p B p 

ier p (Bxp)-ff 
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m + e 
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m + e m + e 
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a f 

B cr - 

2(p X E) • cr 

cr p B • p 1 (p X 
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m + em + e m + e 
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B cr - 

2(p X E) • cr 

cr p Bp i 1 p 2 (B 

• cr) - (p • B)(p • cr) 

£ 

m + e 

m + em + e m + e 

m + e 


m + £ 


B cr - 

O (p x E) • cr 

O cr ■ P B-P | p 2 (B cr) 
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£ 

m + e 

1 / x n 

m + em + e (m + £) 



= — a f 


f ^ B • cr — 2(v x E) • cr- -£— (cr ■ v)(B • v) H-—— v 2 (B ■ cr) 


L 7 


7+1 


7+1 


= -2a f 


cr • B — (v X E) • cr-(cr • v)(B • v) 

v ’ 7+1 
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